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The developat«nt of an interactive error analysis 
systea for a highly interactive prograsing language coapiler is 
esplored. A project is underway at the University of Illinois to 
autoaate the teaching of eleaentary coaputer science prograaing 
language courses by utilizing the PLATO IV interactive coaputer 
systea. one goal of the project is to provide a highly interactive 
coapiler/interpreter that nill allow beginning prograaers to writer 
debug, and run fairly si aple prograas using newly-learned language 
constructs. The error analysis and diagnostic routines for the 
interactive coapiler are divided into two separate systeas. The first 
is concerned exclusively with the syntactic and seaantic errors 
associated with prograa preparation and entry into the coapiler. The 
second error systea Is concerned with the execution of the student *s 
prograa and the error analysis and debugging that is initiated by the 
detection of an execution error. (VH) 
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Introduction 



^ This paper is concerned with the developnient of an interactive 



error analysis system for a highly interactive pi-ogramndnc language 
compiler. Although the proposed system should have reasonably wide 
applicability to various time-sharing systems and interactive compilers, 
the followJnf^ Is a description nf thf» £nf»l,«! ^nd r*ort«!f.va4n4-<! of n 
particular compiler in which the error system is to be implemented. 

A project is undeirway at the University of Illinois at Urbana- 
Caiompoign to automate the teaching of elementary computer science program- 
ming lang\iage courses by utilizing the FIATO IV interactive computer system 
[2], One goal of this pro.iect is to provide a hichly interactive compiler/ 
interpreter thvit will allow beginning programmers to wi'ite, debi:g, and 
run fairly simple programs using newly -learned language constructs [h], 
A requirement for the compiler is that it be able to pjirse and diagnose 
subsets of different programming languages (currently, Pl/l, FCHTRAN, BASIC, 
COBOL) with a minimal amount of redesign required. To change to a new 
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laTij^i;:!^^^, unly the now luxicna aihl iiy!il:u'lii'ul 1 -iblcit muni ha r t\k\- if-.iuvl 
(a relatively strali^hfc-rorwiu'd proeosfs); the bul>. of the compiler's on-line 
editinj;; capabllitifs and internal structure can rctnaln unchanf;ed. 

The overwheljning Goa.1 and emplitiiSis for this compiler is that it 
behave like a "consultant", that is, bo able to dlJ^^mose proni'ajjimins errors 
and help the student understand his errors. A good description of this 
"co«r;taiant" or "tutor" capability is that of a diagnostic system tliat 
points out different interpretations or ways of correcting an error situa- 
tion, e.g., a diagnostic "prompter". T)ie goal is not to have the system 
\t tempt io "recover" from the student's errors, but to interactively infom 
the student of an error and attempt to prompt him with sugcestions about 
ways of correcting the error, requiring the student to actually analyze 
the situation and repair the prc^ran. 

ThG error analysis and diagnostic routines for our interactive 
compiler are divided into two separate systems. The first is concerned 
exclusively with the syntactic and semantic errors associated with program 
preparation and entrj'' into the compiler FsK The second error system is 
conccrncxi with the execution of the student's program and the error nnalysis 
and dobu£:gin2 that is initiated by the detection of an execution err<>r [l]. 
Tlic rctnaindcr of this paper will discuss these two error analysis systems. 



The Compilc-thno Krror AiiMly r.in ruicl Din^noflfclc Svstc?fli 

The cnviromiont in wMch Uie oompilc-tiicc error analysis syfstom 
is invoked is the followin(^. As the student enters the procrara, each 
word or "token" that is typed in is exaadncd iratiediatoly by the syntax 
parser. As soon as an error is detected, the error routine is entered; 
this routine attempts to explain the error to the student, as described 
later in this pcipcr, IVhen tlie student understaiids the error, he must 
back up in the input string and repair tlie error appropriately; he is 
then allowed to continue enter inc the remainder of the prograta. 

The overall philosophy of the approach developed for this error 
system is that effective error diagnostics can be given by informing the 
student about the current internal state of the compiler. Of interest 
to the student might be symbol table information, such as the attributes 
of a variable or whether or not a particular word is a reserved (pre- 
defined) vord; dope vector information about defined arrays in the 
procram; and syntactic requirements, such as what symbols the compiler is 
able to accept at a particular place in the user*s program. The assump- 
tion is that if the compiler's current internal state can be adequately 
understood by the stttdent, then the sjmtactic error (or semantic error, 
such as ti-yinc to re-declare an identifier in PL/l) will become apparent 
and the student can correct the program in the proper manner. 

Tlie problem in trying to iniplement this philosophy is that it is 
virtually impossible to know exactly what Irnigua^^ie construct the student 
tho\ight he was using when an error is detected. Different parts of the 
compiler *s state information may be importajit to the student in different 
error situations; also, different students in an identical cjTor 



situation may very well need different information to \mdorstand thcii* 
error. In an effort to provide the proper information for most situations, 
this paper sii^ijccts nn interactive diagnostic system that tries to present 
to the student "intcllicont guesses'* as to the cause of the error, Uliese 
"nuesces" can he determined by examiiiitic the current input string, the 
syntactic rcquirernonts, the symboj. table, and any other information compris 
inc. the state of the compiler; any discrepancies found in the state, 
information can be reported to tlie student as a possible reason for the 
error (note tluat there will always be at least one discrepancy, since on 
error orisinally detected) . A given error situation may cause three 
of four (or more) aystora-senorated "gueascs'* to be p^esented; the hope is 
that at least one of those "guesses" will be close :>.ough to the a^itual 
error to reveal the error to the student. 

It is instructive to consider a fc.-r wco-tiples ..of sjTitactic/ 
semantic error situations and some appropriate diagnostic messages that 
ai'e generated. All the examples are taken from the Pl/I language, which 
has been used for our first impleraentation of this diagnostic system. Work 
is in progress to extend the system to PORmAH, Since the compiler is 
intended for elementary programmers, only a subset of the most common 
features are accepted (in particular, for these examples the conditional 
expression in nn "IF** stateinent is assumed to be a relational- type 
expression). 
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Kxamplo 1 i 

Assume the following strinf^ is input oo the compiler: 
IF A - B/C Tim 
An initial din^^nostic mcsnnge is f^lven: 

IF A - B/C 



■^*^f5«^«-*x-**'c*R)ssiblc norroction«f-'«^'H('Hf*x*** 
Replace "TIIEN" with aii arithmetic operator. 
If the student requests further analysis, another niessat^e is: 

IF A - B/C [mm] 

intxHHtJf-KvxfrK-itiiosslble Cor r ec t ion*^^ x x x Kit y 

» 

Replace "TIIEIf" with a relational operator. 
A further request for annl^vsis would yield: 

IF 



A - B/C 



THEN 

Replace this arithmetic expression with a conditional expression. 

Kxmnple 2 t 

Assume the following string is inpiit to the compiler: 
I = J * 10 TO 
ftie messiige that is given is: 

I = J * 10 



TO 



*>{-)Hf«iHf-*w*i^g5iblc Correction »-'<-«^-^*'HHHHf* 
Replace "TO" with an arithmetic operator or a 
Another message is: 

01 = J * 10 TO 
iHe**iHt4tK* x X;''-possiblG Correct ion 
Insert "DO" in front of the identifier "l". 



6 



Pmn thono f^xninplcn it in app-ircnt that the error system in 
tryiru; to nivo Uie student well-dhTctcd hints about possible causcc for 
the error fiituvilion. Note alao that these mecsacf^n first make sxicj-^estions 
about possible "local" corrnctionn to the progrnn, assuminc that evory- 
thini; thnt vaa Input before t!ie token at which the eriror was detected is 
correct. If the student roquestn further suti^f'stions, then the error 
analysis sy^^tetH considers coi-rections of text to the left of the error 
token. Besides being perhaps the nK3st natural order of analysis, this 
methcid also is cood. from .a coinputaticnal point of view: as more and more 
left con^:cxt is considered, the anK>unt of computing required to determine 
a possible siif^fiostion increases rapxdl;^'; the hope is that the first few 
initial messages will cause the student to discover the error, and that 
the more expensive suggestions, while always available when needed, will 

Uol> hv xt:«iulic;J mUcL Of the txhicr. 

Consider another example: 
Example 3 : 

Assume that the follovring declaration is made at the beginning 
of the program: 

DIXLARE A (10), B; 
Then suppose that the student later types in: 

B = A ; 

One messa^je that is piven is: 

B = A0 

•>Me-if**HMH<-iHe-K-*i\,s n ible Correct ion 

Insert the az'ray subscripts in front of the "j". 
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*?H-iH^*«^» K XX ^rossiblc Correct ion*)^* 

Replace array reference "A" vith an ordinary identifier. 

Tiiis example illustrates another feature of the diagnostic 
system. The analysis routines and corresixjnding dia^jnostic messacos are 
able to manipulate and display infi^smiation about important langxiage con- 
structn, such as '*array subscripts", "expressions", "statements", etc. 
These hinh-lcvel constructs are cm: .zed durinc the teaching of a 
prosraiPiiinr!; 1an{;iiag:e and should be referred to at appropriate times by 
the dia^tnostic system. It is essent'al that the diafjnostic system be 
capable of Biaking sugccstions using terminology that is understandable to 
the student. 

One final requirement for the cotnpile-time diagnostic system is 
that as much of this "prompter" as is possible be built into the compiler 
and provided automatically for different programming lanj^uages. The 
to achieving this is to have the diagnostic system be driven by the syntax ' 
parser table and compiler symbol tables, !Ehen, constructing canpiler 
parser and symbol tabJes for a new lanc«age wil3 be sufficient to allow 
the proper operation of the interactive dia^^ostic prompter. 

Let's now examine the error analysis and diagnostic system that 
is used to assist the sti:dent with program execution errors. 



For in-i;\:ram execution, it is assumed tliat the proftrrim being 
Civ<jn to tliG execution j>acka{^o Is a «ynta£jtieal3y correct procrtm for 
the lan|riiaf,o beinc written. This is insured by the proccai? of ijninediate 
detection and anilysis of syntnx errors as described in the previous 
section. The run-time supervisor is an interpreter which intei'preta a 
tokenizcd version of the original source language program. During 
interpretation of each statement, error conditions are tested and, if 
locntrd, an appropriate error message is displayed to the student. If 
the student requests he3p, control then passes to the execution'time 
error analysis program. 

The error analysis program scales the student's profjrain looking 
for conditions which it knars are coiranon causes of errors. Note the 
distinction between an error and a cause of on error. An execution error 
is any inegal condition which the interpreter can find by examininc 
various data stnictures in the run-time environment (e.g., subscript out 
of ranee). The cause of an error is tlie mistake in logic or in the use 
of a programming lajiguase feature that residted in that error (e.g., 
improper declaration of an array). Tlie scan for tJicso causes is done 
by reverse executing the prc^^ram, statement by statement, starting at the 
position of the error. This reverse execution insures that the analysis 
routines are examininf; eau-h statement in the same execution environment 
in which it was executed during forward execution. 

When a potential cause for an error is located, a discussion 
with the student is initiated conccmins that condition. For example, 
when the following simple Pl/l program is exectited, the indicated error 
is detected: 



STRlf4<;.SUBSTR 'STPIfii:. j 



If the student typec IIELP, then the error analysis system commences its 
reverse execution, l^hen a statcrocnt is located vhich is relevant to the 
error in question, Uiat statement if f lapsed and broiif.ht to the attention 
of the student: 



STRING. 'f^CC J 

' 'fRINCi«SLff*£TK k;> I k'If*i , ■ 




POSITIW <T CNCOJTTOfi CTW 



Thia otatenvsnt gave PCS an tr>corr«ct value of 



0. 



30 



Next, thnt staUement Sr. t»K.'iminod for postti Wo Condi tiona vhich ml^hl: hnvc 
caiifit-d the cri'or. In Ihir, pfirtJcttl:*!' caxjc, a built-in function, INDKX, 
haa returnea a ^peci.-d-cai^c rocult. T\\U fact, if doscrib{>d to the 
student nay help tlie studc?nt to undcrntond wli^ liia error has occurred; 



OECLfiRE STPIt4G WiRdS} :V*¥rrING, .POS.rtXEDj 




XNDEIX returned a value of sero here bo-rause the 
fttrtna; 'X' could T>ot b-s lound an>"wh«re in the string 



If the student now understands what caused his error, ho may choose to 
edit and re-cxecute his jirocrmn. If he chooses to rcqtjcst more help, 
the values of the operands involved ai*e discussed and the student Informs 
the analysii? procrnm wJiothor they soein reasonnb^G of not J 



n 



CCaWE STRING m-«(it»J .VrftiLi^G, FOS FiXCDt 

.STRING- -r^eCOE'.- '"^r 

POS» INCCN tf=>TP I^i^. • X ' ) . 

\ ■ , . . 

« «*r-i 1 T POSITI0?j Of C>£CUTIOfj C PRc^ 

Thia »tatef>»ent POS an incorr-.-ct ^alue of g, 




?3 a value of "hfiCCC reascnabl- for SWit^ 

If you are nc-t steslwisii: r-.-^-i tiv«, an-*ver no* 



If the student now ronponds by sc^yinG that all the values are indeed 
correct tlion tlio present exi)rcsslon being exHniined must be incorrect: 



EXftTPLi: PRocmjSE: opaofs rmiN) j 

1 .KTCLflRE STRir^ CHWdO) . Vt=iFrrl*4G, PCS FIXED? 

STR ING- 'ASCDE' ; 

PCS« {irjrcwtt'TPifjC. ■■/.''p I 



END: 




Thia stat«»nent »&»ve/ 



POSITIOfj or rXECUTIOTj ERROR 



an incorrect value of 
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On the assunfrtion that ait of ^*our previous 
retir-c-nr.e!& <.>^r*? trtu*^ tl-H* m^i < <^«pr<??5ton 

that S'ou e^v ti»!»t )tl v.^ri.t^l^s CvM<t«<trvi t^athin 
tt ccntatn c-trr^t values, k<.it tfv* 'i^rt m^t ion 
variable mav tKt crnt^tn t\f^ r^^ultmj: v.^lue. 



IP 



Tf tho r>1-utoiit h-ul rtM}xnult\t by s:ny!ii,.': tljal. k<ii:io ct' the Nf^ivtnb.lon 
hn.l qnci?t.iotnb.lo v:iliic;:, Mion ovror analy;i5n would continue in l.ho rovofi^c 
-llroclion looHInr t\ir r.i rH orunils vhK.'h t^frcot thoije new VMriablcs*. 

It. is hopod Uinl this aimpllf cxariplo doinonstratcn l.ho t..\7i«^ of 
nnalyiJiii j>crrorr:iod. By rMUr^ tJu? student m«:unmia iniG33 icont cotmcnts 
ribcmt ixr-^i^.lonr. In hlo tnVi:rnn! wldi'h oi'f relevant to the execution onw, 
it is hopt'>d timt the st\ulont will eoinprehcnd Mhy his ei'ror occurred. As 
dni'Inn our;ip51o-tiir.o error analy^sls, nun^erouc sue.;r:e£5tions are ir:ade to the 
f?f.u«1f>nis, but it is they who mtist. decide how to correct their procraras 
since only they cim know wiiat their pro^^rasns wore siipposod to do. 
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Tn f.u'iuK'u'y , thl:? p:itH->f' Ji.'ij'. tl;u?v*tu?:^i\l an tijtnroaeli to p!*o,>';rjur. e^vror 
;uinly;;j;^ t.h-y\ ov.:p\vurrM-^ tho fU'tivatio;-; of a pro!^,ra!?i"dn('; lnii,'.-cnMf-o "tutor" 
or 'Vonr.n.Uniil". Wlien a CD!r.}unv-t..Ii;ie ii^ut.-u!tlc error ia dctocLt\l, the 
d.in;-:nos-J,.u' syi?l>c;?- trior, to "protnpt" Iho cttv.dcnt into analysijif; tlu> error 
nitnatitui hy f-ivin.f t}ie stisdont hints in tlie form of "possible correction.^" 
to tlic p-".>j:i\'vjti. For :in execution error, the din,.-^nor.tio syntem trios to 
direct t.}ic r,t.iident .tn imalynint^ the erj-or ^^'itiu'tion by showing how various 
roTovnnt aoctions of the pro^rra::! were acturJly exocntoiU 

At the prcr.ont time, the two diagnostic i3yr4-.eiic described have 
been it:.plof!;entt.\l in prototype form in the interactive (compiler on tlie 
riATO IV coir.puter-aidt^d instruction cystcm, Vtork is -in prot^ress to refine 
the injtin.1 verr.ions and to provide more complete tciTisinology for coninuni- 
catln;^ vith the stinlcnt. 
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